We have $6$ $6$ possible predictor variables in the data set to predict body_mass_g:
- species, island, bill_length_mm, bill_depth_mm, flipper_length_mm, sex

Encoding the categorical variables binary we have $p=8$ $p = 8$ possible predictors
- isAdelie, isGentoo , fromTorgersen, fromBiscoe, bill_length_mm, bill_depth_mm, flipper_length_mm, isFemale
from that we can build ${2^p} = {2^8}= 256$ $2^{p} = 2^{8} = 256$ possible linear models:
- $\text{body mass g}=\beta_0$
- $\text{body mass g}=\beta_0 + \beta_1 \times \text{isAdelie}$
- $\text{body mass g}=\beta_0+ \beta_2 \times \text{isGentoo}$
- ...

Training-Set	Validation-Set	Test-Set
used to train the models	used to select the model, predictors $\vec{X}$ and/or parameters	used to prove the models performance
5-fold CV (with Validation)	5-fold CV (with Training)	$15\%$ cut out at the beginning

2.5 Model Development